智能论文笔记

a survey on GPT-3

Mingyu Zong , Bhaskar Krishnamachari

分类：自然语言处理 | 人工智能

2022-12-01

This paper provides an introductory survey to GPT-3. We cover some of the historical development behind this technology, some of the key features of GPT-3, and discuss the machine learning model and the datasets used. We survey both academic and commercial efforts applying GPT-3 in diverse domains such as developing conversational AI chatbots, software development, creative work, domain knowledge, and business productivity. We discuss some of the challenges that GPT-3 faces such as the problems of training complexity, bias, and hallucination/incorrect answers. We also discuss the future research opportunities in this area.

translated by 谷歌翻译

QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols

Dev Churiwala , Bhaskar Krishnamachari

分类：机器学习

2022-11-28

Automated Market Makers (AMMs) have cemented themselves as an integral part of the decentralized finance (DeFi) space. AMMs are a type of exchange that allows users to trade assets without the need for a centralized exchange. They form the foundation for numerous decentralized exchanges (DEXs), which help facilitate the quick and efficient exchange of on-chain tokens. All present-day popular DEXs are static protocols, with fixed parameters controlling the fee and the curvature - they suffer from invariance and cannot adapt to quickly changing market conditions. This characteristic may cause traders to stay away during high slippage conditions brought about by intractable market movements. We propose an RL framework to optimize the fees collected on an AMM protocol. In particular, we develop a Q-Learning Agent for Market Making Protocols (QLAMMP) that learns the optimal fee rates and leverage coefficients for a given AMM protocol and maximizes the expected fee collected under a range of different market conditions. We show that QLAMMP is consistently able to outperform its static counterparts under all the simulated test conditions.

translated by 谷歌翻译

Learning Practical Communication Strategies in Cooperative Multi-Agent Reinforcement Learning

Diyi Hu , Chi Zhang , Viktor Prasanna , Bhaskar , Krishnamachari

分类：人工智能 | 机器学习

2022-09-02

在多机构强化学习中，沟通对于鼓励代理商之间的合作至关重要。由于网络条件随代理的移动性而变化，并且在传输过程中的随机性变化，因此现实无线网络中的通信可能非常不可靠。我们提出一个框架来通过解决三个基本问题来学习实用的沟通策略：（1）何时：代理商不仅基于消息重要性，而且是无线渠道条件来学习沟通时间。（2）什么：代理增强了带有无线网络测量结果的消息内容，以更好地选择游戏和通信操作。（3）如何：代理使用新颖的神经信息编码器来保存从接收到的消息中保留所有信息，而不管消息的数量和顺序如何。与最新的ART相比，在逼真的无线网络设置下模拟标准基准测试，我们在游戏性能，收敛速度和沟通效率方面取得了重大改进。

translated by 谷歌翻译

Federated Learning for Internet of Things: Applications, Challenges, and Opportunities

Tuo Zhang , Lei Gao , Chaoyang He , Mi Zhang , Bhaskar Krishnamachari , Salman Avestimehr

分类：机器学习

2021-11-15

数十亿无线设备将在不久的将来部署，利用更快的互联网速度和更多终点所带来的终点的可能性更快。随着IOT设备的盛开，将生成可能包含用户私人信息的大量数据。与隐私问题的高通信和储存成本，越来越挑战传统的集中式超云学习和处理IOT平台的生态系统。联邦学习（FL）已成为此问题最有前途的替代方法。在FL中，数据驱动的机器学习模型的培训是多个客户端之间的协作行为，而无需将数据带到中心点，因此减轻了通信和存储成本并提供了很大程度的用户级隐私。我们讨论了FL对于IOT平台的机会和挑战，以及如何启用未来的IOT应用程序。

translated by 谷歌翻译

Corruption-tolerant Algorithms for Generalized Linear Models

Bhaskar P Mukhoty , Debojyoti Dey , Purushottam Kar

分类：机器学习 | (统计)机器学习

2022-12-11

This paper presents SVAM (Sequential Variance-Altered MLE), a unified framework for learning generalized linear models under adversarial label corruption in training data. SVAM extends to tasks such as least squares regression, logistic regression, and gamma regression, whereas many existing works on learning with label corruptions focus only on least squares regression. SVAM is based on a novel variance reduction technique that may be of independent interest and works by iteratively solving weighted MLEs over variance-altered versions of the GLM objective. SVAM offers provable model recovery guarantees superior to the state-of-the-art for robust regression even when a constant fraction of training labels are adversarially corrupted. SVAM also empirically outperforms several existing problem-specific techniques for robust regression and classification. Code for SVAM is available at https://github.com/purushottamkar/svam/

translated by 谷歌翻译

LDL: A Defense for Label-Based Membership Inference Attacks

Arezoo Rajabi , Dinuka Sahabandu , Luyao Niu , Bhaskar Ramasubramanian , Radha Poovendran

分类：机器学习

2022-12-03

The data used to train deep neural network (DNN) models in applications such as healthcare and finance typically contain sensitive information. A DNN model may suffer from overfitting. Overfitted models have been shown to be susceptible to query-based attacks such as membership inference attacks (MIAs). MIAs aim to determine whether a sample belongs to the dataset used to train a classifier (members) or not (nonmembers). Recently, a new class of label based MIAs (LAB MIAs) was proposed, where an adversary was only required to have knowledge of predicted labels of samples. Developing a defense against an adversary carrying out a LAB MIA on DNN models that cannot be retrained remains an open problem. We present LDL, a light weight defense against LAB MIAs. LDL works by constructing a high-dimensional sphere around queried samples such that the model decision is unchanged for (noisy) variants of the sample within the sphere. This sphere of label-invariance creates ambiguity and prevents a querying adversary from correctly determining whether a sample is a member or a nonmember. We analytically characterize the success rate of an adversary carrying out a LAB MIA when LDL is deployed, and show that the formulation is consistent with experimental observations. We evaluate LDL on seven datasets -- CIFAR-10, CIFAR-100, GTSRB, Face, Purchase, Location, and Texas -- with varying sizes of training data. All of these datasets have been used by SOTA LAB MIAs. Our experiments demonstrate that LDL reduces the success rate of an adversary carrying out a LAB MIA in each case. We empirically compare LDL with defenses against LAB MIAs that require retraining of DNN models, and show that LDL performs favorably despite not needing to retrain the DNNs.

translated by 谷歌翻译

Zero-Shot Opinion Summarization with GPT-3

Adithya Bhaskar , Alexander R. Fabbri , Greg Durrett

分类：自然语言处理

2022-11-29

Very large language models such as GPT-3 have shown impressive performance across a wide variety of tasks, including text summarization. In this paper, we show that this strong performance extends to opinion summarization. We explore several pipeline methods for applying GPT-3 to summarize a large collection of user reviews in a zero-shot fashion, notably approaches based on recursive summarization and selecting salient content to summarize through supervised clustering or extraction. On two datasets, an aspect-oriented summarization dataset of hotel reviews and a generic summarization dataset of Amazon and Yelp reviews, we show that the GPT-3 models achieve very strong performance in human evaluation. We argue that standard evaluation metrics do not reflect this, and evaluate against several new measures targeting faithfulness, factuality, and genericity to contrast these different methods.

translated by 谷歌翻译

High Speed Convoy in Unstructured Indoor Environments

Namya Bagree , Charles Noren , Damanpreet Singh , Matthew Travers , Bhaskar Vundurthy

分类：机器人

2022-11-11

Practical operations of coordinated fleets of mobile robots in different environments reveal benefits of maintaining small distances between robots as they move at higher speeds. This is counter-intuitive in that as speed increases, increased distances would give robots a larger time to respond to sudden motion variations in surrounding robots. However, there is a desire to have lower inter-robot distances in examples like autonomous trucks on highways to optimize energy by vehicle drafting or smaller robots in cluttered environments to maintain communication, etc. This work introduces a model based control framework that directly takes non-linear system dynamics into account. Each robot is able to follow closer at high speeds because it makes predictions on the state information from its adjacent robots and biases it's response by anticipating adjacent robots' motion. In contrast to existing controllers, our non-linear model based predictive decentralized controller is able to achieve lower inter-robot distances at higher speeds. We demonstrate the success of our approach through simulated and hardware results on mobile ground robots.

translated by 谷歌翻译

Fairness in Federated Learning via Core-Stability

Bhaskar Ray Chaudhury , Linyi Li , Mintong Kang , Bo Li , Ruta Mehta

分类：机器学习

2022-11-03

Federated learning provides an effective paradigm to jointly optimize a model benefited from rich distributed data while protecting data privacy. Nonetheless, the heterogeneity nature of distributed data makes it challenging to define and ensure fairness among local agents. For instance, it is intuitively "unfair" for agents with data of high quality to sacrifice their performance due to other agents with low quality data. Currently popular egalitarian and weighted equity-based fairness measures suffer from the aforementioned pitfall. In this work, we aim to formally represent this problem and address these fairness issues using concepts from co-operative game theory and social choice theory. We model the task of learning a shared predictor in the federated setting as a fair public decision making problem, and then define the notion of core-stable fairness: Given $N$ agents, there is no subset of agents $S$ that can benefit significantly by forming a coalition among themselves based on their utilities $U_N$ and $U_S$ (i.e., $\frac{|S|}{N} U_S \geq U_N$). Core-stable predictors are robust to low quality local data from some agents, and additionally they satisfy Proportionality and Pareto-optimality, two well sought-after fairness and efficiency notions within social choice. We then propose an efficient federated learning protocol CoreFed to optimize a core stable predictor. CoreFed determines a core-stable predictor when the loss functions of the agents are convex. CoreFed also determines approximate core-stable predictors when the loss functions are not convex, like smooth neural networks. We further show the existence of core-stable predictors in more general settings using Kakutani's fixed point theorem. Finally, we empirically validate our analysis on two real-world datasets, and we show that CoreFed achieves higher core-stability fairness than FedAvg while having similar accuracy.

translated by 谷歌翻译

Dynamic neuronal networks efficiently achieve classification in robotic interactions with real-world objects

Pakorn Uttayopas , Xiaoxiao Cheng , Udaya Bhaskar Rongala , Henrik Jörntell , Etienne Burdet

分类：机器人

2022-10-12

Biological cortical networks are potentially fully recurrent networks without any distinct output layer, where recognition may instead rely on the distribution of activity across its neurons. Because such biological networks can have rich dynamics, they are well-designed to cope with dynamical interactions of the types that occur in nature, while traditional machine learning networks may struggle to make sense of such data. Here we connected a simple model neuronal network (based on the 'linear summation neuron model' featuring biologically realistic dynamics (LSM), consisting of 10 of excitatory and 10 inhibitory neurons, randomly connected) to a robot finger with multiple types of force sensors when interacting with materials of different levels of compliance. Scope: to explore the performance of the network on classification accuracy. Therefore, we compared the performance of the network output with principal component analysis of statistical features of the sensory data as well as its mechanical properties. Remarkably, even though the LSM was a very small and untrained network, and merely designed to provide rich internal network dynamics while the neuron model itself was highly simplified, we found that the LSM outperformed these other statistical approaches in terms of accuracy.

translated by 谷歌翻译